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Abstract 



Binary relations are an important abstraction arising in many data 
representation problems. The data structures proposed so far to repre- 
sent them support just a few basic operations required to fit one particular 
application. We identify many of those operations arising in applications 
and generalize them into a wide set of desirable queries for a binary rela- 
tion representation. We also identify reductions among those operations. 
We then introduce several novel binary relation representations, some sim- 
ple and some quite sophisticated, that not only are space-efficient but also 
efficiently support a large subset of the desired queries. 

1 Introduction 

Binary relations appear everywhere in Computer Science. Graphs, trees, in- 
verted indexes, strings and permutations are just some examples. They also 
arise as a tool to complement existing data structures (such as trees [5] or 
graphs [2]) with additional information, such as weights or labels on the nodes 
or edges, that can be indexed and searched. Interestingly, the data structure 
support for binary relations has not undergone a systematic study, but rather 
one triggered by particular applications. We aim to start such a study in this 
article. 

Let us say that a binary relation 1Z relates objects in [l,fi] with labels in 
[1,<t], containing t pairs out of the no possible ones. We focus on space-efficient 

*An early version of this article appeared in Proc. LATIN 2010. Partially funded by Fonde- 
cyt grant 1-110066, Chile (first and third authors). Funded in part by Google U.S. /Canada 
PhD Fellowship Program and David R. Cheriton Scholarships Program (second author). 



123456789 

A . . 1 

B 11.. 

C . . . 1 . 1 . 1 . 

D . 1 

E 1 . . 1 1 . . . . 

F 1 

G . . . . 1 . 1 . . 

H 1 1 

Figure 1: An example of binary relation. 



representations considering a simple entropy measure, 

H(K) = Ig (7) = ilg^ + 0(t) 

bits, which ignores any other possible regularity. Figure [T] illustrates a binary 
relation (we identify labels with rows and objects with columns henceforth). 

Previous work focused on relatively basic primitives for binary relations: 
extract the list of all the labels associated with an object or of all the objects 
associated with a label (an operation called access), or extracting the j-th such 
element (an operation called select), or counting how many of these are there 
up to some object/label value (called operation rank). 

The first representation specifically designed for binary relations [5 supports 
rank, select and access on the rows (labels) of the relation, for the purpose 
of supporting faster joins on labels. The idea is to write the labels of the pairs 
in object-major order and to operate on the resulting string plus some auxiliary 
data. This approach was extended to support more general operations needed 
for text indexing [16] . The first technique [5] was later refined [6] into a scheme 
that allows one to compress the string while still supporting the basic operations 
on both labels and objects. The idea is to store auxiliary data on top of an 
arbitrary representation of the binary relation, which can thus be compressed. 
This was used to support labeled operations on planar and quasi-planar labeled 
graphs [2]. 

Ad-hoc compressed representations for inverted lists [34] and Web graphs [To] 
can also be considered as supporting binary relations. The idea here is to 
write the objects of the pairs, in label-major order, and to support extracting 
substrings of the resulting string, that is, little more than access on labels. One 
can add support for access on objects by means of string select operations |14j . 
The string can be compressed by different means depending on the application. 

In this paper we aim at settling the foundations of efficient compact data 
structures for binary relations. In particular, we address the following points: 

• We define a large set of operations of relevance to binary relations, widely 
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extending the classic set of rank, select and access. We give a number 
of reductions among operations in order to define a core set that allows 
one to efficiently support the extended set of operations. 

• We explore the power of the reduction to string operators [5] when the op- 
erations supported on the string are limited to rank, select and access. 
This structure is called BinRel-Str, and we show it achieves interesting 
bounds only for a reduced set of operations. 

• We show that a particular string representation, the wavelet tree |24j . 
although not being the fastest one, provides native support for a much 
wider set of operations within logarithmic time. We call BinRel-WT 
this binary relation representation. 

• We extend wavelet trees to generalized wavelet trees [18], and design new 
algorithms for various operations that take advantage of their larger fan- 
out. As a result we speed up most of the operations within the same space. 
This structure is called BinRel-GWT. 

• We present a new structure, binary relation wavelet tree (BRWT), that is 
tailored to represent binary relations. Although the BRWT gives weaker 
support to the operations, it is the only one that approaches the entropy 
space H(1Z) within a multiplicative factor (of 1.272). 

For the sake of brevity, we aim at the simplest description of the operations, 
ignoring any practical improvement that does not make a difference in terms of 
asymptotic time complexity, or trivial extensions such as interchanging labels 
and objects to obtain other space/time tradeoffs. 

2 Compact Data Structures for Sequences 

Given a sequence S of length n, drawn from an alphabet £ of size <r, we define 
the following queries (omitting S if clear from context): 

• rank a (S', i) counts the occurrences of symbol o G E in S[l, i]. 

• select a (S', j) finds the position of the j'-th occurrence of symbol a G S in 
S. 

• access^, i) = S[i]. 

The solutions in the literature are quite different depending on the alphabet 
size, a. 
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2.1 Binary Sequences 



For the special case £ = {0,1}, there exist representations using o(n) bits on 
top of a plain representation of S, and answering the three queries in constant 
time [T3] • The extra space can be made as low as 0(nlglgn/lgn), which is 
optimal |21j . This extra data structure on top of a plain representation of S is 
called an index. 

If we are allowed, instead, to represent S in a specific way, then one can 
compress the sequence while still supporting the three operations in constant 
time. The overall space can be reduced to nH (S) + o{n) bits [53]. Here H (S) 
is the zero-order entropy of sequence S, defined as 



where n a is the number of occurrences of symbol a in S. The o(n) extra space 
on top of the entropy can be made as small as 0(n/lg c n) for any constant c 



2.2 Sequences over Small Alphabets 

If a = 0(lg £ n), for a constant < e < 1, it is still possible to retain the space and 
time complexities of binary sequence representations |18j . The main idea is that 
we only need a compressed sequence representation that provides constant-time 
access, or more precisely, that can access 0(lg ff n) consecutive symbols from 
S in constant time (this is achieved by extending similar compressed bitmap 
representations [55]V 

Then, in order to support rank a and select a , we act as if we had an explicit 
bitmap B a [l,n], where B a [i] = 1 iff S[i] = a. Then rank a (5 l , i) = ranki(_B a ,i) 
and select (S', j) — selecti(_B a , j). We store only the rank/select index of 
each B a , and can solve rank/select on B a by extracting any desired chunk 
from S. The only difference is that we cannot access O(lgn) contiguous bits 
from B a in constant time, but only C^lg^ n). 

As a result, the index for each B a requires 0{n lglgn/lg^n) bits of space. 
Added over all the a £ S, the total space for the indexes is 0(na lg lg n/ lg CT n) = 



2.3 General Sequences and Wavelet Trees 

The wavelet tree [53] reduces the three operations on general alphabets to those 
on binary sequences. It is a perfectly balanced tree that stores a bitmap of 
length n at the root; every position in the bitmap is either or 1 depending on 
whether the symbol at this position belongs to the first half of the alphabet or to 
the second. The left child of the root will handle the subsequence of S marked 
with a at the root, and the right child will handle the Is. This decomposition 
into alphabet subranges continues recursively until reaching level [lg a] , where 
the leaves correspond to individual symbols. We call B v the bitmap at node v. 
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Figure 2: Example of a wavelet tree for the sequence EHDHACEEGBCBGCF. 



The access query S[i] can be answered by following the path described for 
position i. At the root v, if B v [i] = 0/1, we descend to the left/right child, 
switching to the bitmap position rank /i(-B„, i) in the left/right child, which 
then becomes the new v. This continues recursively until reaching the last 
level, when we arrive at the leaf corresponding to the desired symbol. Query 
rank a (S l , i) can be answered similarly to access, except that we descend ac- 
cording to a and not to the bit of B v . We update position i for the child node 
just as before. At the leaves, the final bitmap position i is the answer. Query 
select a (5', j) proceeds as rank Q , but upwards. We start at the leaf representing 
a and update j to select / 1 (B v , j) where v is the parent node, depending on 
whether the current node is its left/right child. At the root, position j is the 
result. 

If the bitmaps B v are represented in plain form (with indexes that support 
binary rank/select), then the wavelet tree requires n\ga + o[n)\go bits of 
space, while answering all the queries in 0(\ga) time. If the bitmaps B v are 
instead represented in compressed form [32], the wavelet tree uses nHo(S) + o(n) 
bits and retains the same time complexities. Figure [2] illustrates the structure. 
Wavelet trees are not only used to represent strings [18], but also grids [TO] , 
permutations [8], and many other structures. We refer in particular to a recent 
article 20; where in particular some (folklore) capabilities are carefully proved: 
(1) any range of symbols [a, 0] is covered by k — 0(\g(f3 — a + 1)) wavelet tree 
nodes, and these can be reached from the root by traversing 0(k + lgu) nodes. 

A way to speed up the wavelet tree operations is to use generalized wavelet 
trees [18]. These are multiary wavelet trees, with arity // = 0(lg e n), for a con- 
stant < e < 1. Bitmaps B v are replaced by sequences S v over a (small) alpha- 
bet of size /i. All the operations on those sequences are still solved in constant 
time, but now the wavelet tree height is reduced to 0(lg M er) = 0(lgcr/lglgro), 
and thus this is the complexity of the operations. The space can still be bounded 
by nH (S)+ o{n) [23]. 

Wavelet trees are not the only sequence representation achieving basically 
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nH (S) bits of space [221 S]- The best current alternative [J uses nH (S) + 
o(nHo(S)) + o(n) bits and solves the three queries within time O(lglgcr). This 
is preferable when a is large. 

2.4 Range Minimum Queries (RMQs) 

The Range Minimum Query (RMQ) operation on a sequence S[l,n] was not 
listed among the basic ones, but we cover it because we will make use of it in 
Lemma It is defined as RMQ(S', — a,rgmm i<k< j S[k], that is, it returns 
the position of the minimum value in a range S[i,j]. If there are more than one, 
it returns the leftmost minimum. 

It is possible to solve this query in constant time using just In + o(n) bits 
of space, without even accessing S itself [15] . 

3 Operations 

3.1 Definition of operations 

We now motivate the set of operations we define for binary relations. Their full 
list, formal definition, and illustration, are given in|X] 

One of the most pervasive examples of binary relations are directed graphs, 
which are precisely binary relations between a vertex set V and itself. Extracting 
rows or columns in this binary relation supports direct and reverse navigation 
from a node. To support powerful direct access to rows and columns we define 
operations obj_accl(a,x,y), which retrieves the objects in [x,y] related to la- 
bel a, in arbitrary order, and the symmetric one, lab_accl(a, fi, x). In case we 
want to retrieve the pairs in order, obj_minl(a, x), which gives the first object 
> x related to label a, can be used as an iterator, and similarly labjminl(a, x). 
These operations are also useful to find out whether the link (a, x) exists. Note 
that adjacency list representations only support efficiently the retrieval of di- 
rect neighbors, and adjacency matrix only support efficiently the test for the 
existence of a link. 

Web graphs, and their compact representation supporting navigation, have 
been a subject of intense research in recent years [TTJ [15] (see many more 
references therein). In a Web graph, the nodes are Web pages and the edges 
are hyperlinks. Nodes are usually sorted by URL, which not only gives good 
compression but also makes ranges of nodes correspond to domains and sub- 
directories For example, counting the number of connections between two 
ranges of nodes allows estimating the connectivity between two domains. This 
count of points in a range is supported by our operation rel_num(o;, fj, x, y), 
which counts the number of related pairs in [a, /3] x [a;,y]. The individual links 
between the two domains can be retrieved, in arbitrary order, with operation 
rel_acc(a, j3, x, y). In general, considering domain ranges enables the analysis 
and navigation of the Web graph at a coarser granularity (e.g., as a graph of 

1 More precisely, we have to sort by the reversed site string concatenated with the path. 
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hosts, or institutions). Our operations obj_acc(a, /3, x, y), which gives the ob- 
jects in [x,y] related to a label in [a, f3], extends obj_accl to ranges of labels, 
and similarly lab_acc(a, /3, x, y) extends lab_accl. Ordered enumeration and 
(coarse) link testing are supported by operations obj_min(o!, 0, x), which gives 
the first object > x related to a label in [a, /3], and similarly lab_min(a, x, y) 
for labels. 

A second pervasive example of a binary relation is formed by the two- 
dimensional grids, where objects and labels are simply coordinates, and where 
pairs of the relation are points at those coordinates. Grids arise in GIS and many 
other geometric applications. Operation rel_num allows us to count the number 
of points in a rectangular area. A second essential operation in these applications 
is to retrieve the points from such an area. If the retrieval order is not important, 
rel_acc is sufficient. Otherwise, operation rel_acc_labjmaj (x, y, a, z) serves as 
an iterator to retrieve the points in label-major order. It retrieves the first 
point in [a,a\ x [z,y], if any, and otherwise the first point in [a+l,er] x [x,y]. 
Operation rel_acc_obj_maj (a, /?, 7, x) is similar, for object-major order. For 
an even more sophisticated processing of the points, rel_sel_lab_maj (a, j, x, y) 
and rel_sel_obj_maj(a!,/3,2;, j) give access to the j-th element in such lists. 

Grids also arise in more abstract scenarios. For example, several text index- 
ing data structures [TH [23 [23 [23 resort to a grid, which relates for example 
text suffixes (in lexicographic order) with their text positions, or phrase pre- 
fixes and suffixes in Lempel-Ziv compression, or two labels that form a rule in 
grammar-based compression, etc. The operations most commonly needed are, 
again, counting and retrieving (in arbitrary order) the points in a rectangle. 

Another important example of binary relations are inverted indexes [34], 
which support word-based searches on natural language text collections. In- 
verted indexes can be seen as a relation between vocabulary words (the labels) 
and the documents where they appear (the objects). Apart from the basic oper- 
ation of extracting the documents where a word appears (obj_accl), a popular 
operation is the conjunctive query (e.g., in Google-like search engines), which 
retrieves the documents where k given words appear. These are solved using a 
combination of the complementary queries obj_rnkl(a, x) and obj_sell(a, x,j) 
[51 17]. The first operation counts the number of points in [a, a] x [1, x], whereas 
the second gives the j-th point in [a, a] x [x,n]. 

Extending these operations to a range of words allows for stemmed and/or 
prefix searches (by properly ordering the words), and are implemented using 
obj_rnk(a, /3, x) and obj_sel(a ) /3,x, j), which extend obj_rnkl and obj_sell 
to ranges of labels. Extracting a column, on the other hand (lab_accl), gives 
important summarization information on a document: the list of its different 
words. Intersecting columns (using the symmetric operations lab_rnkl(a, a;) 
and lab_sell(a, x, j)) allows for analysis of content between documents (e.g., 
plagiarism or common authorship detection). Handling ranges of documents 
(supported with the symmetric operations lab_acc, lab_rnk(a, x, y), and 
lab_sel(a, j, x, y)) allows for considering hierarchical document structures such 
as XML or file systems (where one operates over a whole subtree or subdirec- 
tory). 
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Figure 3: Reductions among operations. 



Similar representations are useful to support join operations on relational 
databases and, in combination with data structures for ordinal trees, to support 
multi-labeled trees, such as those featured by semi-structured documents (e.g., 
XML) (5). A similar technique [2] combining various data structures for graphs 
with binary relations yields a family of data structures for edge-labeled and 
vertex-labeled graphs that support labeled operations on the neighborhood of 
each vertex. For example, operations rel_min_lab_maj and rel_min_obj jnaj 
support the search for the highest neighbor of a point, when the binary relation 
encodes the levels of points in a planar graph representing a topography map [5] • 

The extension of those operations to the union of labels in a given range al- 
lows them to handle more complex queries, such as conjunctions of disjunctions. 
For example, in a relational database, consecutive labels may represent a range 
of parameter values (e.g., people of age between 20 and 40). 

We define other operations for completeness: rel_rnk_lab_maj acts like 
the inverse of rel_sel_lab_maj , and similarly rel_rnk_obj_maj; lab_num and 
objjnum are more complete versions of lab_rnk and obj_rnk; and rel_rnk is a 
more basic version of reljnum. 

3.2 Reductions among operations 

We give a set of reductions among the operations introduced. The results are 
sumarized in the following theorem. 

Theorem 1. For any solid arrow op — » op' in Figure [3j it holds that if op is 
solved in time t, then op' can be solved in time 0(t). For the dotted arrows with 
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associated penalty factors 0(t'), it holds that if op is solved in time t, then op 1 
can be solved in time 0{tt r ). 

Proof. Several reductions are immediate from the definition of the operations 
in [A] (those arrows are in bold in Figure [3]). We prove now the other ones. We 
consider only the reductions for the left side of the figure (operations related to 
labels); the same idea applies for the right side (objects). 

• rel_rnk — > rel_rmm 

rel_num(a, j3, x, y) = reljrnk(a — 1, x — 1) — reljrnk(a! — 1, y) 

— rel_rnk(/?, x — 1) + rel_rnk(a — 1, x — 1). 

• rel_rnk_lab_maj — > rel_num 

reljaum(a, /3, x, y) — rel_rnk_lab_maj (/3, x, y, y) 

— rel_rnk_lab_maj (a — 1, x, y, y). 

• rel_acc — > (lab_accl, obj_accl) 

lab_accl(a, j3, x) — {7, (7, x) E rel_acc(a, /?, x, x)}, 
obj_accl(a, x, y) — {z, (a, z) E rel_acc(a, a, x, y)}. 

• rel_sel_lab_maj — > rel _min_lab jnaj : in order to solve query 
rel_min_lab jmaj (a, x, y, z) we first test if rel_sel_lab_maj (a, 1, z, y) gives 
a pair of the form (a, w), in which case we return it. Otherwise, we return 
rel_sel_lab_maj (a + 1, 1, x, y). 

• rel_min_lab_maj — > rel_acc: to solve rel_acc(a, /3, x, y), we find a first 
point (7,2) = rel_min_lab_maj (a, x, y, x). The next element is obtained 
as (7', z') = rel_min_lab_maj (7, x, y, z + 1) and so on, until we reach the 
first answer with label greater than /?. 

• rel_min_labjnaj — > labjnin: let (7,2) = rel_min_lab_maj (a, x, y, x), 
then lab_min(a, x, y) = 7. 

• lab_min — > lab_acc: we report 7 = lab_min(a, x, y), 7' = lab_min(7 + 
1, x, y), and so on until reaching a result larger than /3. The points reported 
form lab_acc(a, /3, x, y). 

• lab_minl — > lab_accl: similar to the previous reduction. 

Finally, the non-constant time reductions are explained the following way: 

• (rel_rnk_lab_maj , rel_sel_lab jnaj ) works in both ways by doing a binary 
search over the results of the other operation, in the worst case considering 
no elements. 
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Figure 4: Sequence S and bitmap i? for representing the binary relation shown 
in Figure [T] 



• (lab_num, lab_sel) operates the same way as the previous one, but search- 
ing among a elements in the worst case, thus the O(lgcr) penalty. (For 
the objects this becomes O(lgn).) 

□ 

The reductions presented here allow us to focus on a small subset of the 
most difficult operations. In some cases, however, we will present more efficient 
solutions for the simpler operations and will not use the reduction. 

4 Reduction to Strings: BinRel-Str 

A simple representation [5J 116) for a binary relation 1Z formed by t pairs in 
[1, n] x [1, a] uses a bitmap B[l, n + t] and a string S[l, t] over the alphabet [1, a]. 
The bitmap B concatenates the consecutive cardinalities of the n columns of 
the relation, in unary. The string S contains the rows (labels) of the pairs of 
the relation in column (object)-major order. Figure [4] shows the representation 
for the binary relation shown in Figure [T] Barbay et al. [5] showed that an easy 
way to support operations obj_rnkl and obj_sell on the binary relation is to 
support the operations rank and select on B and S, using any data structure 
known for bitmaps and strings (recall Section [2]). Note also that the particular 
case reljnum(l, a, x, y) can be answered in O(l) time using B. In the sequel we 
extend Barbay et al.'s work as much as possible considering our considerably 
larger set of operations. This approach, building only on rank, select and 
access on B and S, will be called BinRel-Str. 

We define some notation used in the rest of the paper. First, we call map(ir) 
the mapping from a column number x to its last element in S: map(a;) = 
ranki(B, selecto(-B, x)). The inverse, from a position in S to its column 
number, is called unmap(m) = rank (_B, select! (i?,m)) + 1. Both mappings 
take constant time. Finally, let us also define for shortness rank c (_B, x, y) — 
rank c (_B, y) — rank c (£>, x — 1). 

Assume our representation of S supports access in time a, rank in time r 
and select in time s. Table [l] shows the complexity achieved for each binary 
relation operation using this approach. As it can be seen, the scheme extends 
nicely only to operations involving one row or one column. In all the other 
cases, the complexities are linear in the lengths of the ranges to consider. As 



10 



the algorithms are straightforward and their complexities uninteresting, we defer 
them to HE] 

Various string representations [22, 4 offer times a, r, and s that are constant 
or log-logarithmic on a. These yield the best time complexities we know of for 
the row- wise and column- wise operations, although these form a rather limited 
subset of the operations we have defined. 

The space used by techniques based on representing B and S (including 
BinRel-WT and BinRel-GWT) is usually unrelated to H(TZ), the entropy of 
the binary relation. Various representations for S[l, t] achieve space tH (S) plus 
some redundancy [2H [221 0]. This is tH (S) = J2 a e[i a] n " ^& n~> wnere n a is 
the number of pairs of the form (a, •) in 1Z. While this can be lower than H(7Z) 
(which shows that our measure H(1Z) is rather crude), it can also be arbitrarily 
higher. For example an almost full binary relation has an entropy H(1Z) close to 
zero, but its tHg(S) is close to na lg a. A clearer picture is obtained if we assume 
that S is represented in plain form using t lg a bits. This is to be compared to 
H(1V) — t lg Tp+0(i)) which shows that the string representation is competitive 
for sparse relations, t = 0(n). 

5 Using Wavelet Trees: BinRel-WT 

Among the many string representations of S we can choose in BinRel-Str 
scheme, wavelet trees |24j turn out to be particularly interesting. Although 
the time wavelet trees offer for a, r and s is O(lgcr), not the best ones for 
large a, wavelet trees allow one to support many more operations efficiently, via 
other algorithms than those used by the three basic operations. We call this 
representation BinRel-WT. Table [T] summarizes the time complexity for each 
operation using BinRel-WT, in comparison to a general BinRel-Str. Next, 
we show how to support some key operations efficiently; the other complexities 
are inferred from Theorem^ 

The first lemma states a well-known algorithm on wavelet trees [27] . 

Lemma 1. BinRel-WT supports rel_rnk(a, x) in Oilga) time. 

Proof. This is rank< Q (S f , map(x)), where operation rank< Q (5,p) counts the 
number of symbols < a in 5[l,p[. It can be supported in time O(lgcr) on 
the wavelet tree of S by following the root-to-leaf branch corresponding to a, 
while counting at each node the number of objects preceding position p that are 
related with a label preceding a, as follows. Start at the root v with counter 
c <— 0. If a corresponds to the left subtree, then enter the left subtree with 
p <— ranko(B v ,p). Else enter the right subtree with c <— c + ranko(B v ,p) and 
p raiiki(B v ,p). Continue recursively until a leaf is reached (indeed, that of 
a), where the answer is c + p. □ 

2 To simplify, in Tabic [l] we omit some complexities that are most likely to be inferior to 
their alternatives. 
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Operation 


BinRel-Str 


BinRel-WT 


rel_rmm(a, /3, x, y) 


0({/3 - a + l)r) 
0((y-x + l)alg/3) 


O(lga) 


rel_rnk(a, x) 


0{ar) 
0{xa lg a) 


O(lga) 


rel_rnk_lab_maj (a, x, y, z) 


O(ar) 

0((y — a: + l)a lg a) 


O(lga) 


rel_sel_lab_maj (a, j, x, y) 


0((<r - Q + l)r + s) 


O(lga) 


rel_min_lab_maj (a, x 1 y, z) 


0((ct - a + l)r + s) 
0((y — a; + l)olg a) 


O(lga) 


rel_rnk_obj _maj (a, /3, 7, x) 


0(08 - a + 1)'') 
O(jjolg /3) 


O(lga) 


rel_sel_obj _maj (a, /3, a;, j) 


0((n -x + l)alg/3) 


0(lgjlg(/9-a + l)lgff) 
O(lgnlger) 


rel_min_obj _maj (a, /3, 7, a;) 


0((/3-a + l)(s + r)) 
0((n — x + l)a lg a) 


O(lga) 


rel_acc(a, /3, a;, y) 


0((P - a + l)r + sfe) 
0{{y — x + l)a lg a + a/e) 


0((fe + l)lg<r) 


lab_num(a, /3, a;, y) 


O((0 - a + l)r) 


0(/3 -a + lgtr) 


lab_rnk(a, a;, y) 


O(ar) 


0(a + lgo-) 


lab_sel(a, x, y) 


0((<r - a + l)r) 




lab_acc(a, /3, a;, y) 


0((/3 - a + l)r) 
0((j/ — a: + l)alg a) 


0((fc + l)lgcr) 


lab_min(a, a;, y) 


0((a - a + l)r) 
0((y - x + l)alga) 


O(lga) 


obj_num(a, /3, a;, y) 


0((y - x + l)o lg a) 


0((y- x + l)\gcr) 


obj_rnk(a, /3, a;) 


0(a;a lg a) 


0(x lg cr) 


obj_sel(a, /3, a;, j) 


0((n — 2: + l)alga) 


0(i lgff) 


obj_acc(a, /3, a;, y) 


0((p - a + l)(r + sk)) 
0{{y — x + l)a lg a) 


0((fe + l)lg<r) 


obj_min(a, /3, a;) 


0((J3 - a + l)(r + s)) 
0((n — x + l)alga) 


O(lga) 


lab_rnkl(a, a;) 


O(olga) 


O(lga) 


lab_sell(a, j, x) 


O(alga) 


O(lga) 


lab_minl(a, x) 


O(alga) 


O(lga) 


lab_accl(a, 0, x) 


0(a(fc + lga)) 


0((fc + l)lgcr) 


obj_rnkl(a, x) 


O(r) 


O(lga) 


obj_seli(a 3 x, j) 


0(r + s) 


O(lga) 


obj_minl(a:, x) 


0(r + s) 


O(lga) 


obj_accl(a 1 x, y) 


0(r + sfe) 


0((fe + l)lg<r) 



Table 1: Time complexity for the operations using BinRel-Str and BinRel- 
WT. The parameter k represents the size of the output for the access operators; 
one can consider k — 1 for the reductions given in Theorem [TJ 
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The next lemma solves an extended variant of a query called range _quantile 
in the literature, which was also solved with wavelet trees within the same 
complexity 20 . Note that the lemma gives also a solution within the same time 
complexity for labjnin, which in the literature |20j was called rangejnextjvalue 
and solved with an ad-hoc algorithm, within the same time. 

Lemma 2. BinRel-WT supports rel_sel_lab_maj (a, j, x, y) in O(lgtr) time. 

Proof. We first get rid of a by setting j <- j + rel_num(l, a — l,x,y) and 
thus reduce to the case a = 1. Furthermore we map x and y to the domain 
of S by p <— map (a; — 1) + 1 and q map(y). We first find which is the 
symbol /3 whose row contains the j-th element. For this sake we first find the 
P such that rank< ( 3_i(S', p, q) < j < rank< / 3(S', p, q). This is achieved in time 
O(lger) as follows. Start at the root v and set j' <— j. If rank (B v , p, q) > j, 
then continue to the left subtree with p <— raiik.Q(B v ,p — 1) + 1 and q 
ranko(i?,j, q). Else continue to the right subtree with j' <— j' — ranko(-B t ,,p, q), 
p <— ranki(i?„,p — 1) + 1, and y <— ranki(_B„, q). The leaf arrived at is /3. 
Finally, we answer unmap(select ( g(S', j' + raiik.p(S,p — 1))))- d 

The wavelet tree is asymmetric with respect to objects and labels. The 
transposed problem, rel_sel_obj_maj, turns out to be harder. We present, 
however, a polylogarithmic-time solution. 

Lemma 3. BinRel-WT supports rel_sel_obj_maj (a, /3, x, j) in 
0(min(lgn, lg j lg(/3 — a + 1)) lg a) time. 

Proof. Remember that the elements are written in S in object major or- 
der. First, we note that the particular case where [a,/3] = [l,u] is easily 
solved in O(lgcr) time, by doing j' <— j + reljnum(l, a, 1, x — 1) and return- 
ing (^[j'], unmap(j')). In the general case, one can obtain time 0(lgnlg a) 
by binary searching the column y such that reljnum(a, /3, x, y — 1) < j < 
rel_num(a, /3, x, y). Then the answer is (lab_sell(a, j — rel_num(o;, ft, x, y — 
l),y),y) (note that Lemma [2] already gives us lab_sell in time 0(lg<r)). 

To obtain the other complexity, we find the 0(lg(/3 — a + 1)) wavelet tree 
nodes that cover the interval [a,/?]; let these be v%, V2, We map po- 

sition p = map(x — 1) + 1 from the root towards those WjS, obtaining all the 
mapped positions Pi in 0{k + lger) time. [20j Now the answer is within the 
positions [pi,Pi +j — 1] of some i. We cyclically take each «j, choose the middle 
element of its interval, and map it towards the root, obtaining position q, cor- 
responding to pair unmap(g)). If rel_rnk_obj jmaj (a, /3, S[q], unmap(q)) — 
rel_num(a, j3, l,x — 1) = j, the answer is (S l [q],unmap(q)). Otherwise we know 
whether q is before or after the answer. So we discard the left or right interval 
in Vi. After 0(k\gj) such iterations we have reduced all the intervals of length 
j of all the nodes Vi, finding the answer. Each iteration costs O(lgcr) time. □ 

The next lemma solves a more general variant of a problem that was called 
prevLess in the literature, and also solved with wavelet trees [26]. We achieve 
the same complexity for this more general variant. Note this is a simplification 
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of rel_sel_obj jnaj that we can solve within time O(lga), whereas for general 
j we cannot. 

Lemma 4. BinRel-WT supports rel_min_obj jmaj (a, ft, 7, x) in 0(lgo~) time 
per pair output. 

Proof. We first use labjminl(7, x) (which we already can solve in time O(lger) 
as a consequence of Lemma [2]) to search for a point in the band [7,/?] x [a;, a;]. 
If we find one, then this is the answer, otherwise we continue with the area 
[a,j3] x [x + l,n]. 

Just as for the second solution of Lemma |3j we obtain the positions pi at 
the nodes Vi that cover [a, ft]. The first element to deliver is precisely one of 
those pi. We have to merge the results, choosing always the smaller, as we 
return from the recursion that identifies the nodes. If we are in v^, we return 
q = pi. Else, if the left child of v returned q, we map it to q' <— selecto(-B„, q). 
Similarly, if the right child of v returned q, we map it to q" <— selecti(B v ,q). 
If we have only q' (q"), we return q = q' {q — q"); if we have both we return 
q = min(g', q"). The process takes O(lger) time. When we arrive at the root 
we have the next position q where a label in [a, ft] occurs in S, and thus return 
unmap(g). □ 

The next result can also be obtained by considering the complexity of the 
BinRel-Str scheme implemented over a wavelet tree. 

Lemma 5. BinRel-WT supports obj_sell(a, x, j) in O(lgcr) time. 

Proof. This is a matter of selecting the j-th occurence of the label a in S, 
after the position of the pair (a,x). The formula is unmap (select „ (5", j + 
obj jrnkl(a, x — 1))). □ 

The next operation is the first of the set we cannot solve within polyloga- 
rithmic time. 

Lemma 6. BinRel-WT supports lab_num(o!, f3, x, y) in 0(f3 — a + lgc) time. 

Proof. After mapping [x,y] to positions S\p,q], we descend in the wavelet tree 
to find all the leaves in [a, ft] while remapping [p, q] appropriately. We count 
one more label each time we arrive at a leaf, and we stop descending from an 
internal node if its range [p, q] is empty. The complexity comes from the number 
of wavelet tree nodes accessed to reach such leaves (50] . □ 

The remaining operations are solved naively: lab_sel and obj_sel use, 
respectively, labjmin and obj_min successively, and objjnum and obj_rnk use 
obj_rnkl successively, 

The overall result is stated in the next theorem and illustrated in Figure [5] 

Theorem 2. The structure BinRel-WT, for a binary relation 1Z of t pairs 
over [l,o-] x [1, n], uses t lg a + 0(n + t) bits of space and supports the operations 
within the time complexities given in Tabled 
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Figure 5: Reductions among operations supported by BinRel-WT. The dot- 
ted boxes are operations supported in w(lg a) time, the filled boxes represent 
operations directly addressed in the paper, and the blank boxes are supported 
via reductions. 



Proof. The space assumes a plain uncompressed wavelet tree and bitmap rep- 
resentations, and the time complexities have been obtained throughout the sec- 
tion. □ 



6 Using a Generalized Wavelet Tree: BinRel- 
GWT 

The results we obtained for the wavelet tree can be extended to the generalized 



wavelet tree, improving complexities in many cases (recall Section 2.3 1. We 
refer to the structure that represents S using the generalized wavelet tree as 
BinRel-GWT, and we use /j, to represent the fan-out of the tree. We require 
fj, G 0(lg e (na)), assuming that the RAM machine can address up to n x a cells. 
To simplify we will assume a < n and then use simply /i € 0(lg e n). 

A first simple result stems from the fact that the string operations are sped 
up on generalized wavelet trees. 

Lemma 7. BinRel-GWT supports obj_rnkl and obj_sell in time Oilg^a) 
for any fi £ 6(lg e n) and any constant < e < 1. 

Proof. This follows directly from the results on general BinRel-Str structures. 

□ 
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The following notation will be useful to describe our algorithms within 
wavelet tree nodes. Note that all are easily computed in constant time. 



• child(fc): Given a symbol k £ [1,/i], this is the subtree labeled k of the 
current node. 

• g(a): Given a symbol a £ [l,c], g(a) is the symbol k such that child(fc) 
contains the leaf corresponding to a. 

• g _1 (fc): Given a symbol k £ [1,/-*], g ~ 1 (k) = min{a,fc = g(a)}. 

The next lemma shows how to speed up all the range counting operations. 
The result is known in the literature for nxn grids, even within n lg n(l + o(l)) 
bits of space [10] . 

Lemma 8. BinRel-GWT supports rel_rnk(a, x) in O(\g^o~) time, for any 
fx £ 6(lg e n) and any constant < e < 1. 

Proof. As in the case of BinRel-WT, we reduce this problem to the one of 
computing r£ink< Q ,(S', map(x)), which can be done by following a similar pro- 
cedure: We follow the path for a starting at the root in position p = map(x) 
and with a counter c 0. Every time we move to a subtree, we increase 
c 4— c + rank< g ( Q )_ 1 (S'„,p). When we arrive at the leaf, the answer is c + p. 
Operation rank<fc(S'„,p) can be solved in constant time for fi £ 0(lg c n) 



analogously as done for rankfc on small alphabets (recall Section 2.2 1. We store 
for each k £ [1>/*] a bitmap such that B<).[i] = 1 iff S v [i] < k. Thus 
rank<fe(S'„,p) = ranki(_B<fc,p) is computed in constant time and the whole 
process takes 0(lg M cr) time. □ 

The next lemma covers operation rel_sel_lab_maj , on which we cannot 
improve upon the complexity given by BinRel-WT. Note this means that 
O(lger) is still the best time complexity for supporting range-quantile queries 
within linear space [2"0~j . 

Lemma 9. BinRel-GWT supports rel_sel_lab_maj (a, j, x, y) in O(lger) 
time. 

Proof. This is solved in a similar way to the one presented for BinRel-WT. We 
find v such that rank<^_i(5',p, q) < v < rank^^, p, q). The only difference is 
that in this case we have to do, at each node, a binary search for the right child 
k £ to descend, and thus the time is 0(lg \i lg a) — O(lgcr). □ 

For the next operations we will augment the generalized wavelet tree with 
a set of bitmaps inside each node v. More specifically, we will add + 
l)/2 bitmaps -Bfc,;, where Bki[i] = 1 iff S v [i] £ [k,l]. Just as bitmaps S<fc, 
bitmaps B^ i are not represented explicitly, but only their index is stored (recall 



Section 2.2), and their content is simulated in constant time using S v . Their 
total space for a sequence is 0(r^ 2 lglgn/ lg n). To make this space 
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negligible, that is, o(nlgfi), it is sufficient that fi = 0(\g e n) for any constant 
< e < 1/2. (A related idea has been used by Farzan et al. [T7].) 

The next lemma shows that the current solution for operation prevLess 26 
can be sped up by an O(lglgn) factor. 

Lemma 10. BinRel-GWT supports rel_min_obj jnaj (a, j3, 7, x) in Oilg^a) 
time, for any \i £ @(lg c n) and any constant < e < 1/2. 

Proof. We first run query rel_min_obj_maj(7,/3,7,a;), and if the result is on 
column x, we report it. Otherwise we run query rel_min_obj_maj(o;,/3,a,x + 
1). This means that we can focus on a simpler query of the form 
rel_min_obj jnaj (a, /?, x), which finds the first pair in [a,/3] x [x,n], in object- 
major order. We map [x,n] to S[p, t] as usual and then proceed recursively on 
the wavelet tree, remapping p. At each node v, we decompose the query into 
three subqueries, and then take the minimum result of the three: 

1. rel_min_obj_maj(a,g _1 (g(a) + 1) — l,x) on node child(a); 

2. rel_min_obj_maj(g _1 (g(a:) + 1), g _1 (g(/3)) — 1, a?) on the same node v; 

3. reljriin_obj _maj(g (g(/3)), /3, x) on node child(/3). 

Note that queries of type 1 will generate, recursively, only 0(lg„ c) further 
queries of type 1 and 2, and similarly queries of type 3 will generate 0(lg„ cr) 
further queries of type 3 and 2. The only queries that actually deliver values 
are those of type 2, and we will have to take the minimum over 0(lg„ a) such 
results. 

A query of type 2 is solved in constant time using bitmap -B g ( Q )+i !g (m-i> by 
computing q = selecti(S g ( Q ,) +lig ( | g)_ 1 ,ranki(B g ( a ) +1>g (^)_ l! p - 1) + 1). This 
returns a position S v [q]. As we return from the recursion, we remap q in its 
parent in the usual way, and then (possibly) compare q with the result of a 
query of type 1 or 3 carried out on the parent. We keep the minimum q value 
along the way, and when we arrive at the root we return (S*[c/], unmap(c/)). □ 

For the next lemma we need a further data structure. For each sequence 



iSu[l,n], we store an RMQ structure (Section 2.4), using 0(n) = o{n\gp) bits 
and finding in constant time position of a minimum symbol in any range S v [i,j]. 
This results improves upon the result for query range jnext_value |20j . 

Lemma 11. BinRel-GWT supports reljiiin._lab_maj (a,x,y, z) in 0(\g^a + 
lg /i) time, for any \l G 6(lg e n) and any constant < e < 1. 

Proof. Again, we can focus on a simpler query rel_min_lab_maj (a, x, y). We 
map [x,y] to S[p,q] as usual, and the goal is to find the leftmost minimum 
symbol in S[p, q] that is larger than a. 

Assume we are in a wavelet tree node v and the current interval of interest is 
S v [p, q] . Then, if S v [p, q) contains symbol g(a) (which is known in constant time 
with rank g ( Q ,)(5, J ,p, q) > 0), we have to consider it first, by querying recursively 
the child labeled g(a). If this recursive call returns an answer p' , we return it 
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in turn, remapping it to the parent node. If it does not, then any symbol larger 
than a in the range must correspond to a symbol strictly larger than g(a) in 
S v \p, q]. We check in constant time whether there is any value larger than g(a) 
in S v [p, q], using rank< g ( Q )(5V,,p, q) < q — p + 1. If there is none, we return in 
turn with no answer. 

If there is an answer, we binary search for the smallest k £ [g(ct) + 1, n] such 
that rank<fc(S'„,p, q) > rank< g ( ce )(S' 1 ,,p, q). This binary search takes O(lg^) 
time and is done only once along the whole process. Once we identify the right 
k, we descend to the appropriate child and start the final stage of the process. 

The final stage starts at a node where all the local symbols represent original 
symbols that are larger than a, and therefore we simply look for the position 
m = RMQ(S v ,p,q), which gives us, in constant time, the first occurrence of 
the minimum symbol in S v , and descend to child S [to]. This is done until 
reaching a leaf, from where we return to the root, at position p' , and return 
(<S"[y/], unmap(p')). It is easy to see that we work 0(1) time on Oilg^a) nodes 
and O(lg^i) once. □ 

Lemma 12. BinRel-GWT supports rel_sel_obj_maj (a, /?, x, j) in 
0(min(lgn, lgj lg(/3 — a + l))lg <r) time, for any [i € 6(lg £ n) and any 
constant < e < 1/2. 

Proof. The complexities are obtained the same way as for BinRel-WT. The 
binary search over reljnum is sped up because BinRel-GWT supports this op- 
eration faster. The other complexity is in principle higher, because the interval 
[a,/3] is split into as much as 0(fi\g(f3 — a + 1)) nodes. However, this can be 
brought down again to 0(lg(/3 — a + 1)) by using the parent node v of each 
group of (up to fj,) contiguous leaves [k,l], and using selecti on the bitmaps 
Bkj of those parent nodes in order to simulate a contiguous range with all the 
values in [k, I]. So we still have 0(lg(/?— a + 1)) binary searches of O(lgj) steps, 
and now each step costs 0(lg M <r). □ 

Lemma 13. BinRel-GWT supports lab_num(a!, j3, x, y) in 0(/3 — a + lg^cr) 
time, for any fj, E 0(lg c n) and any constant < e < 1/2. 

Proof. We follow the same procedure as for BinRel-WT. The main difference 
is how to compute the nodes covering the range [a,/3]. This can be done in a 
naive way by just verifying whether each symbol appears in the range of S v , but 
this raises the complexity by a factor of /i. Thus we need a method to list the 
symbols appearing in a range of S v without probing non-existent ones. We resort 
to a technique loosely inspired by Muthukrishnan [55] . To list the symbol from a 
range [fc, I] that exist in S v [p, q], we start with the first symbol of the range that 
appears in S v [p,q]. This is obtained with p' = selecti (-Bfc.j, rank 1 (B ki i,p - 
1) + 1). If p' > q then there are no such symbols. Otherwise, let k' — S v [p']. 
Then we know that k' appears in S v [p, q] . Now we continue recursively with 
subranges [k,k' — 1] and [k' + 1,1]. The recursion stops when no p' is found, and 
it yields all the symbols appearing in S v [p, q] in O(l) time per symbol. □ 
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Figure 6: Reductions among operations supported by BinRel-GWT. The dot- 
ted boxes are operations supported in uj(\go~) time, the solid ones in O(lger) 
time, and the thick ones in time Oilg^a) or 0(lg„ cr + lg/x). The filled boxes 
represent operations addressed directly, and the blank boxes are supported via 
reductions. 



The remaining operations are obtained by brute force, just as with BinRel- 
WT. Figure [6] illustrates the reductions used. 

Theorem 3. The structure BinRel-GWT, for a binary relation 1Z oft pairs 
over [l,cr] x [l,n], requires ilgcr(l + o(l)) + 0(n + t) bits of space and supports 
the operations within the time complexities given in Table^ 

7 Binary Relation Wavelet Trees (BRWT) 

We propose now a special wavelet tree structure tailored to the representation of 
binary relations. This wavelet tree contains two bitmaps per level at each node 
v, B l v and B r v . At the root, B l v [l, n] has the x-th bit set to 1 iff there exists a pair 
(a, x) with a £ [1, Lc/2J], and B r v has the x-th bit set to 1 iff there exists a pair 
(a,x) with a e [ ^cx / 2J + l,c]. Left and right subtrees are recursively built on 
the positions set to 1 in B l v and B 1 U1 respectively. The leaves (where no bitmap 
is stored) correspond to individual rows of the relation. We store a bitmap 
B[l, a + t] recording in unary the number of elements in each row. See Figure [7] 
for an example. For ease of notation, we define the following functions on 
B, trivially supported in constant-time: lab(r) = 1 + ranko(-B, selecti(£?, r)) 
gives the label of the r-th pair in a label-major traversal of R; while its inverse 
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Operation 


BinRel-GWT 


BinRel-WT 


rel_num(a, /3, x, y) 


OQgc/lg lgn) 


0(lg<r) 


rel_rnk(a, x) 


0(lgc/lg lgn) 


0(lg<7) 


rel_rnk_lab_maj (a, x, y, z) 


0(lgo-/ lg lgn) 


0(lg<T) 


rel_sel_lab_maj (a. j, x 1 y) 


O(lga) 


0(lg<r) 


rel_min_lab_maj (a, x, y, z) 


0(lgo"/lg 'gn + lg lg") 


0(lg<r) 


rel_rnk_obj _maj (a. {3, 7, x) 


0(lgo'/lg lg") 


0(lg<7) 


rel_sel_obj jnaj (a, /3, x, j) 


Oflg j lg(/3 - a + 1) lg (7/ lg lg n) 


0(lgjlg(/3-a + l)lg a) 




0(lgnlg(T/lg lgn) 


O(lgnlga) 


rel_min_ob j jnaj (a , /3 , 7, x) 


0(lgo-/ lg lgn) 


O(lga) 


rel_acc(a, /3, a;, y) 


0((k+ l)l g( x/ lglgn) 


0((fe + l)lga) 


lab_num(a, /3, a;, y) 


Of/J-a + lgcr/lg lgn) 


0((3 - a + \ga) 


lab_rnk(a, x, y) 


O(o + lgCT/lglgn) 


0(a + Iga) 


lab_sel(a, j, x, y) 


0(j(lgff/lglgn + lg lg n)) 


0(]\g<j) 


lab_acc(a, x : y) 


0((fc+ l)(lger/lg lgn + lglgn)) 


0((k + 1) lg a) 


lab_min(a, x. y) 


0(lga/ lg lgn + lglgn) 


0(lg<7) 


obj_num(a, /3, x, y) 


0((y - z + l)lg<7/lg lgn) 


0((y-x + l)lg<x) 


obj_rnk(a, /3, x) 


0((y-x + l)lg<7/lg lgn) 


0({y- x+l)\ga) 


obj_sel(a, /3, x : j) 


0(.j'lgo71g lgn) 


O(jlga) 


obj_acc(a, /3, x) 


0((fc+ l)lg<x/ lglgn) 


0((fe + l)lga) 


obj_min(a, /3, x) 


0(lgc/lg lg") 


0(lg<7) 


lab_rnkl(a, x) 


0(lgc/ lg lgn) 


0(lg<T) 


lab_sell(a, j, x) 


O(lga) 


0(lg<7) 


lab_minl(a, j, x) 


C(lgcr/ lglgn + lglgn) 


0(lg<7) 


lab_accl(a. j, x) 


0((fc+ l)lg<x/ lglgn) 


0((fe + l)lga) 


obj_rnkl(a, x) 


0(lgo-/ lg lgn) 


0(lg<r) 


obj_sell(a. x, j) 


OQgu/lg lgn) 


0(lg<7) 


obj_minl(a, x, j) 


C(lgcr/ lg lgn) 


O(lga) 


obj_accl(a, x. j) 


0((fc + l)lg<r/ lglgn) 


0((fc + l)lga) 



Table 2: Time complexity for the operations for BinRel-GWT and BinRel- 
WT. The parameter k represents the size of the output for the access operators; 
one can consider k = 1 for the reductions shown in Theorem [T] 



poslab(a) = ranki(£>, selecto(-B, a)) gives the position in the traversal where 
the pairs for label a start. 

Note that, because an object x may propagate both left and right, the sizes 
of the second-level bitmaps may add up to more than n bits. Indeed, the last 
level contains t bits and represents all the pairs sorted in row-major order. As 
we will see, the BRWT has weaker functionality than our former structures 
based on wavelet trees, but it reaches space proportional to H(1Z). 

Lemmas [14] to [18] give a set of operations that can be supported with the 
BRWT structure. 

Lemma 14. BRWT supports reljnum(a, f3, x, y) in 0(f3 — a + Igcr) time. 

Proof. We project the interval [x,y] from the root to each leaf in [a,/3], adding 
up the resulting interval sizes at the leaves. Of course we can stop earlier if the 
interval becomes empty. Note that we can only count pairs at the leaves, not 
at internal nodes. □ 

Lemma 15. BRWT supports rel_min_lab_maj (a, x, y, z) in O (lgcr) time. 

Proof. As before, we only need to consider the simpler query 
rel_min_lab_maj (a, x, y). We reach the O(lgcr) wavelet tree nodes t!i,U2,... 
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Figure 7: Example of the BRWT for the binary relation in Figure [T] 



that cover the interval [cc, <r], and map [x,y] to all those nodes, in O(lgcr) time 
[20] . We choose the first such node, Vk, left to right, with a nonempty interval 
[x, y\. Now we find the leftmost leaf of Vk that has a nonempty interval [x, y], 
which is easily done in O(lgcr) time. Once we arrive at such a leaf 7 with 
interval [x,y], we map x back to the root, obtaining x', and the answer is 
{ 1 ,x'). ' □ 

Lemma 16. BRWT supports rel_min_obj_maj (a, f3, 7, x) mO(lger) time. 

Proof. As before, we only need to consider the simpler query 
rel_min_objjnaj(a,/?,a;). Analogously to the proof of Lemma |4j we cover 
[a, f3] with O(lger) wavelet tree nodes v\,V2, ■ ■ ■ , and map x to Xi at each such 
Vi, all in O(lgcr) time. Now, on the way back of this recursion, we obtain the 
smallest y > x in the root associated to some label in [a,/3]. In this process we 
keep track of the node Mj that is the source of y, preferring the left child in case 
of ties. Finally, if we arrive at the root with a value y that came from node Vj, 
we start from position x' — Xi at node Uj and find the leftmost leaf of i>i related 
to y. This is done by going left whenever possible (i.e., if B l v [x'] = 1) and right 
otherwise, and remapping x' appropriately at each step. Upon reaching a leaf 
7, we report (7,2/). □ 

Lemma 17. BRWT supports obj_sell(a, x,j) in O(lgcr) time. 

Proof. We map x — 1 from the root to x' in leaf a, then walk upwards the path 
from x' + j to the root and report the position obtained. □ 

Lemma 18. BRWT supports lab_num(a, ft, x, y) in 0(ft — a + lg a) time. 

Proof. We map [x,y] from the root to each leaf in [a,/3], adding one per leaf 
where the interval is non-empty. Recursion can also stop when [x, y] becomes 
empty. □ 
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Figure 8: Reductions among operations supported by BRWT. The dotted boxes 
are operations supported in oj(lgcr) time, the filled boxes represent operations 
directly addressed in the paper, and the blank boxes are supported via reduc- 
tions. 



The remaining complexities are obtained by brute force: rel_sel_lab_maj 
and rel_sel_obj_maj are obtained by iterating with rel_min_labjmaj and 
rel_min_obj_maj, respectively; and similarly lab_sel, obj_sel and lab_sell 
using labjnin, objjnin, and labjninl. Finally, as before obj_num and obj_rnk 
are obtained by iterating over obj_rnkl. We have obtained the following theo- 
rem, illustrated in Figure [8j 

Theorem 4. The BRWT structure, for a binary relation 1Z of t pairs over 
[1, a] x [1, n], uses lg(l + \ 2)H{1Z) + 0(t + n + a) bits of space and supports the 
operations within the complexities given in Table^ 

Proof. The operations have been obtained throughout the section. For the 
space, B is of length a + t. Thus 0(t + n + a) bits account for B and for the 
2n bits at the root of the wavelet tree. The rest of the bits in the wavelet tree 
can be counted by considering that each bit not in the root is induced by the 
presence of a pair. 

Each pair has a unique representative bit in a leaf, and also induces the 
presence of bits up to the root. Yet those leaf-to-root paths get merged, so that 
not all those bits are different. Consider an element x related to t x labels. It 
induces t x bits at t x leaves, and each such bit at a leaf induces a bit per level on 
a path from the leaf towards the single x at the rootj^] At worst, all the 0(t x ) 

3 For example, in Figure Pn object x = 4 is related to labels C and D (see also Figure Pljl . 
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bits up to level lgt x are created for these elements, and from there on all the t x 
paths are different, adding up a total of 0{t x ) + t x lg f- bits. Adding over all x 
we get 0(t) + J2 X txlgf-- This is maximized when t x — t/n for all x, yielding 
0(t) +tlg^ = H(Tl) +0(t) bits. 

Instead of representing two bitmaps (which would multiply the above value 
by 2), we can represent a single sequence B v with the possible values of the two 
bits at each position, 00, 01, 10, 11. Only at the root is 00 possible. Except for 
those 2n bits, we can represent the sequence over an alphabet of size 3 with a 
zero-order representation [23], to achieve at worst (\g3)H(lZ) +o(t) bits for this 
part while retaining constant-time rank and select over each B\ and B r v . (To 
achieve this, we maintain the directories for the original bitmaps, of sublinear 
size.) 

To improve the constant lg 3 to lg(l + V2), we consider that the zero-order 
representation actually achieves \B v \Hq(B v ) bits. We call B x the concatenation 
of all the symbols induced by x, t x — \B X \ < t x , and H x = \B X \H (B X ). 
Assume the t x bits are partitioned into toi 01's, tio 10's, and tn ll's, so that 
t x =i i+*io+2iii,4 =toi+*io+iii, andifs = t 01 lg f^+h lg h+hi lg 4*-. 

L U1 L ll 

As tn = {tx — t$i — iio)/2, the maximum of H x as a function of ioi arL d iio 
yields the worst case at t m — t w = ^-t x , so tn = (| — ^-)t x and £ x = 

(I + ^r)t x , where H x = lg(l + y/2% bits. This can be achieved separately for 
each symbol. Using the same distribution of 01's, 10's, and ll's for all x we add 
up to lg(l + V2)tlg <f + 0{t) = lg(l + V2)H(TZ) + 0{t) bits. (Note that, if 
we concatenate all the wavelet tree levels, the H x strings are interleaved in this 
concatenation.) □ 

Note that this is a factor of lg(l + V2) « 1.272 away of the entropy of 1Z. 

8 Conclusions and Future Work 

Motivated by the many applications where a binary relation 1Z between a labels 
and n objects arises, we have proposed a rich set of primitives of interest in 
such applications. We first extended existing representations and showed that 
their potential is very limited outside single-row or single-column operations. 
Then we proposed a representation called BinRel-WT, that uses a wavelet 
tree to solve a large number of operations in time O(lgcr). This structure has 
been already of use in particular cases, but here we have made systematic use of 
it and exposed its full potential. Furthermore, we have extended the results to 
generalized wavelet trees, to obtain the structure BinRel-GWT. This structure 
achieves 0(lgcr/lglgn) time for many operations. It had already been used for 
range counting and reporting |10) . but here we have extended its functionality 

Its 1 at the second leaf, for C, induces a 1 at its parent, for C-D, and a 1 at the root, for A-D. 
Its 1 at the third leaf, for E, induces a 1 at its parent for E-F and the 1 at the root for E-H. 
The fact that (4, C) £ Ti induces the creation of one column at the leaf for C and one at its 
parent. On the other hand, there are two pairs related to object 1, but they are merged at 
the second level and thus there is only one path arriving at the root. 
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Operation 


BRWT 


BinRel-WT 


rel_mim(a, (3. x, y) 


0(/3-a + l g( r) 


O(lga) 


rel_rnk(a, x) 


0{a + l gf r) 


O(lga) 


rel_rnk_lab_maj (a, x, y. z) 


0{a + lg<r) 


O(lga) 


rel_sel_lab_maj (a, j, x, y) 


0(j lga) 


O(lga) 


rel_min_lab_maj (a, y, z) 


O(lgo-) 


O(lgcr) 


rel_rnk_obj_maj (a, /3, 7, x) 


OC/3-a + lgcr) 


O(lga) 


rel_sel_obj_maj (a, /3, x, j) 


O(jlgCT) 


0(lgjlg(/3-a + l)l g0 -) 
O(lgnlgo-) 


rel_acc(a:. /3, x, y) 


0((fc + l)l g( 7) 


0((fc + l)lg<7) 


rel_min_obj_maj (a, /3, 7, x) 


0(\ga) 


O(lga) 


lab_num(a, /3, a;, y) 


0(/3 - u + \gcr) 


O(0-a+lgo-) 


lab_rnk(a:. x, y) 


0{a + lgcr) 


0(a + lgcr) 


lab_sel(a, j. x, y) 


O{j\go) 


O(jlga) 


lab_acc(a. x, y) 


0((fe + l)lg(7) 


0((fe + 1) lgc) 


lab_min(a, x, y) 


O(lgCT) 


O(lga) 


obj Jium(a. /3, x, y) 


0((y-x + l)l g( r) 


0((y-x + l)lg<r) 


obj_rnk(a, 0, x) 




0((j/-s + l)lg CT ) 


obj „sel(a, /3, x, j) 


0(jlga) 


O(jlgcr) 


obj_acc(a, (3, x) 


0((fc + l)lger) 


0((fe + l)lg<r) 


obj_min(a;, f3. x) 


O(lga) 


O(lga) 


lab_rnkl(a, x) 


O(lgo-) 


O(lga) 


lab_sell(a, j, x) 


0(jlg<7) 


O(lga) 


lab_minl(a, j, x) 


O(lgo-) 


O(lga) 


lab_accl(a, j, x) 


0((fc + l)lger) 


0((fe + l)lg<r) 


obj_rnkl(a, x) 


O(lgo-) 


O(lga) 


obj _sell(a, x, j) 


0(l g( 7) 


O(lga) 


obj _minl(a, x, j) 


O(lgo-) 


O(lga) 


obj _accl(a, x, j) 


0((fe + l)lg(7) 


0((fc + l)lg<7) 



Table 3: Time complexity for the operations for BRWT and BinRel-WT. The 
parameter k represents the size of the output for the access operators; one can 
consider k = 1 for the reductions shown in Theorem [T] 



to many other operations, so that its use as a replacement of the better known 
BinRel-WT structure improves the times achieved in various applications by 
a lg lg n factor. 

Some of those speedups have already been mentioned in the article, namely 
operations ranyejaextjvalue and prevLess (201 123 (range-quantile, on the other 
hand, is an interesting operation that has resisted our attempts to improve it; 
it is equivalent to rel_sel_lab_maj). Another example is finding the dominant 
points on a grid, that is, those (x,y) so that there is not another (x',y') with 
x' < x and y' < y. Using successive calls to rel_min_obj jmaj and restricting 
the next call to the area [x + 1, n] x [1, y — 1] where (x, y) is the last pair found, 
structure BinRel-GWT can find the dominant points in time 0(lgn/lglgn) 
per point, on an n x n grid. This improves upon the O(lgn) time achieved in 
the literature [3D]. 

The speedup on counting and reporting on grids is useful in various text 
indexing data structures. As an example, Arroyuelo et al. [1] describe a Lempel- 
Ziv compressed index that is able to find the occ occurrences of a pattern of 
length m in a text T[l, n] in time 0(m 2 + (to + occ) lg n). For this sake they use 
a 2-dimensional grid where 0(m) rel_rnk and 0(occ) rel_acc operations are 
carried out. By using our new BinRel-GWT structure, their time complexity 
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drops to 0(m 2 + (m + occ) Ign/ lglgn). As a second example, Claude and 
Navarro |16| use grammar-based compression to solve the same problem. Given 
a grammar of n rules and height h, they achieve search time 0(m(m + h) + 
hocc) lgn). They use a 2-dimensional grid where operations rel_acc, lab_sell, 
and obj_sell are carried out, and therefore using a BinRel-GWT structure 
their time is reduced to 0(m(m + h) + h occ) lg nj lg lg n). 

Despite the fact that our structures solve many of the queries we have pro- 
posed in polylogarithmic (and usually logarithmic) time, supporting others re- 
mains a challenge. In particular, we have no good solutions for label-level or 
object-level rank and select queries on ranges. 

Another challenge is space. All of the described structures use essentially 
tiger bits of space, where t is the number of pairs in the relation. While this is 
reasonable in many cases, it can be far from the entropy of the binary relation, 
H(TZ), for dense relations. We have proposed a variant, BRWT, that uses space 
within a multiplicative factor 1.272 of H(TV), yet its functionality is more limited: 
Apart from label-level and object-level queries, this structure does not support 
to efficiently count and select arbitrary points in ranges, albeit it can efficiently 
enumerate them. 

Since the publication of the conference version of this article [5J, a fol- 
lowup work [T7] used our representation BinStr-WT as an internal structure 
to achieve asymptotically optimum space, H (1Z) + o(H (JZ)) bits, and was able to 
solve query rel_rnk in time O(lgn), and queries rel_acc, rel_sel_lab_maj and 
rel_sel_obj_maj in time 0(\g 2 n), on n x n grids. Once again, using structure 
BinRel-GWT their times for rel_rnk and rel_acc can be divided by lglgn. 

It is therefore an open challenge to approach space H(1Z) as much as pos- 
sible while retaining the maximum possible functionality and the best possible 
efficiency. It is unclear which are the limits in this space/time tradeoff, although 
some lower bounds from computational geometry are useful. For example, we 
cannot count points in ranges faster than the BinRel-GWT structure within 
polylogarithmic space [51] . 

On the other hand, H(1Z) is a crude measure that does not account for 
regularities in the row/column distribution, or clustering, that arises in real-life 
binary relations. For example, structures BinRel-WT and BinRel-GWT can 
be compressed to the zero-order entropy of a string of length t, and this can in 
some cases be below H(1Z), which shows that this measure is not sufficiently 
refined. It would be interesting to consider finer measures of entropy and try to 
match them. 

There might also be other operations of interest apart from the set we have 
identified. For example, determining whether a pair is related in the transitive 
closure of 1Z is relevant for many applications (e.g., ancestorship in trees, or 
paths in graphs). Another extension is to d-ary relations, which would more 
naturally capture joins in the relational model. 

Finally, we have considered only static relations. Our representations do 
allow dynamism, where new pairs and/or objects can be inserted in/deleted from 
the waveleet trees [28]. Adding/removing labels, instead, is an open challenge, 
as it alters the shape of the wavelet tree. 



25 



A Formal Definition of Operations 

We formally define our set of operations. Figure [9] graphically illustrates some 
of them. 

• rel_acc(a,/3,x,y) = {(7, 2) eft, y€[a,0\ A z E [x,y]} 

• rel_sel_lab_maj (a,j, x, y) = j-th smallest pair of rel_acc(a, a, x, y) in 
order (a, x) < (Jj, y) a < j3 V (a = j3 A x < y) 

• rel_min_lab_maj (a, x, y, z) = under the same order, smallest pair of 
rel_acc(a, a, 2, y) U rel_acc(a+l, a, x, y) 

• rel_sel_objjnaj(a;,/3,x, j) = j-th smallest pair of rel_acc(a, /3, x, n) in 
order (a, x) < ((3 , y) 4^ x < y V (x — y A a < f3) 

• rel_min_obj jnaj (a, /3, 7, x) = under the same order, smallest pair of 
rel_acc(7, j3, x, x) U rel_acc(a, /3, x+1, n) 

• lab_acc(a, /3, x, y) = {7, 3z, (7, z) € rel_acc(a, /3, x, y)} 

• lab_accl(a, /3, x) = lab_acc(a, /?, x, x) 

• lab_sel(a, j, a;, y) = J-th smallest label of lab_acc(a, tr, x, y) 

• lab_sell(a, j, x) = lab_sel(a, j, x, x), or rel_sel_lab_maj (a, j, x, x) 

• lab_min(a, x, y) — lab_sel(a, 1, x, y) 

• labjninl(o!, x) = labjnin(a, x, x), or lab_sell(a, 1, x) 

• obj_acc(a,/?,x,y) = {z, 3j, (7, z) e rel_acc(a, /?, x, y)} 

• obj_accl(a, x, y) = obj_acc(a, a, x, y) 

• obj_sel(a, /?, x, j) — j-th smallest object of obj_acc(a, /3, x, n) 

• obj_sell(a, x, j) — obj_sel(a, a, x, j), or rel_sel_obj_maj (a, a, x, j) 

• obj jnin(a, /3, x) = obj_sel(a, /3, x, 1) 

• obj _minl(o!, x) = obj jnin(a, a, x), or obj_sell(o;, x, 1) 

• rel_mun(a, /?, x, y) = |rel_acc(a, /3, x, y)| 

• rel_rnk(a,x) = rel_num(l, a, 1, x) 

• rel_rnk_lab_maj (a, x, y, 2) = reljnum(l, a— 1, x, y)+rel_num(a;, a, x, z) 

• rel_rnk_obj_maj(a,/3,7,x) = rel_num(a, /3, 1, x— l)+rel_num(a, 7, x, x) 

• lab_num(a!, j3, x. y) = |lab_acc(a, /?, x, y)| 

• lab_rnk(a, x, y) = lab_num(l, a, x, y) 
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• lab_rnkl(a, x) = lab_rnk(a, x, x), or rel_num(l, a, x, x) 

• obj_num(a,/3, x,y) = |obj_acc(a, /?, x, y)\ 

• obj_rnk(a, /3, x) = obj_num(a, (3, 1, x) 

• obj_rnkl(o!, x) — obj_rnk(a, a, x), or rel_num(a!, a, 1, x) 

B Algorithms on BinRel-Str 

We prove some of the complexities in Table [I] The others can be derived using 
Theorem [TJ 

Lemma 19. BinRel-Str supports reljaum(a, f3, x, y) in 0(min((/3 — a + 
l)r, (y — x + l)alg/?)) time. 

Proof. We can compute rel jnum(o;, f3, x, y) in two ways: 

• Using that rel jnum(a, /3, x, y) = ^ Q<7</3 rel jnum(7, 7, x, y) and that 
rel_num(7, 7, x, y) = rank 7 (5, map(a; — 1) + 1, map(j/)), we achieve time 
0{{p-a + l)r). 

• Using that rel_num(a, /?, a:, y) — J2x<z<y reljnum(a, /?, z, z), we can com- 
pute the value for each z by searching for the successor of a and the pre- 
decessor of j3 in S'[map(z — 1) + l,map(z)]. As this range of S is sorted we 
can find the predecessors and successors using exponential search, which 
requires in 0(lg/3) access operations. Thus the overall process takes time 
0((y-x + l)alg/3). 

□ 

Lemma 20. BinRel-Str supports lab_num(a, f3, x, y) in 0(mm((/3 — a + 
l)r, (y — x + l)(lga + j3 — a)a)) time. 

Proof. We can compute labjrmm(a;, /3, x, y) in two ways: 

• Using that lab_num(a, /3, x, y) — 2 a <-y<a labjium(7, 7, x, y), and that 
lab_num(7, 7, x, y) = 1 iff rel_num(7, 7, x, y) > and zero otherwise, we 
can achieve the same time as in the first alternative of Lemma 1191 

• Using that lab_num(a, /3, x, y) = \ U x < z < y lab_acc(a, /9, z, z)\, we can col- 
lect the labels in [a, j3] x [2, z] for each z and insert them into a dictionary. 
The labels related to a single z can be found by using a similar method 
as the one in Lemma |19| use exponential search to find the first element 
> a in z's area of S, and then scan the next symbols until surpassing f3. 
We mark each label found in a bitmap of length ft — a + 1, and then we 
report the number of ones in it. 

□ 
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Binary Relation - Operations 



rel_num(a, /5, x, y) rel_acc(a, /?, x, i 

1 x V n x 

1 



rel_sel_lab _maj (a, j, x, y) 

x y 



rel_min_lab_maj (a, x, y, z) rel_sel_obj _maj (a, j3, x, j) rel jnin_obj _maj (a, f3, 7, x) 

y x x 



lab_acc(a, j3, x, y) 

x y 



lab_min(a, x, y) 

x 



lab_sel(a, j,x,y) 

x y 



obj_acc(a,/3,.x,i/) 

x y 



obj_min(a, j5, x) 




obj_sel(a,/3,x, j) 

x 



m 



Figure 9: Some operations illustrated. 
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Lemma 21. BinRel-Str supports obj_num(a, j3, x, y) in 0(min((y — x + 
\)a\ga, (j3 — a + l)(r + (y — x + l)s))) time. 

Proof. We can compute obj jnum(a, j3, x, y) in two ways: 

• Using that obj_num(a!, /3, x, y) — J2 x <z<y °bj -Jium(a, /3, z, z), and that 
obj_num(o!, /?, z, z) = 1 iff rel_num(o!, /3, z, z) > and zero otherwise, we 
can proceed similarly as in the second alternative of Lemma |19| by expo- 
nentially searching for the first value > a in z's area of S and checking 
whether it is < 0. 

• Using that obj_num(a, /?, x, y) = | U Q < 7 < ( g obj_acc(7, •y, x, y)\, we can col- 
lect the objects in [7, 7] x [ar, y] for each 7 and insert them into a dictionary, 
as in Lemma [20] The objects related to a single 7 can be found by using 
successive select 7 (S', j) operations on S'[map(x — 1) -I- l,map(?/)], starting 
with j — rank 7 (S', map(x — 1)) + 1. The complexity considers the worst 
case where each such 7 appears y — x + 1 times. 

□ 

Note that, when reducing to implement obj_rnk, the rank 7 (5, •) operation 
is not necessary. For obj_rnkl it is better to reduce from reljnum. 

Lemma 22. BinRel-Str supports rel_sel_lab_maj (a, j, x, y) in 0(min((<7 — 
a)r + s, (y — x + 1)(<7 — a + l)a)) time. 

Proof. Again, we have two possible solutions: 

• Set c 4— 0. For each label 7 in [a, a], compute d <— c + rank 7 (S f , map(x — 
1) + 1, map (y)). If at some step it holds c' > j, the answer is 
(7, unmap(select 7 (S', j — c + rank 7 (5, map(ir — 1)))). Otherwise, update 
c 4— c! . The overall process takes 0((a — a + l)r + s) time. 

• Similarly, but first accumulating all the occurrences of all the labels 7 
and then finding j using the accumulators. We simply traverse the area 
S'[map(z — 1) + l,map(z)] backwards, for each z G [#,2/], accessing each 
label S[k] and incrementing the corresponding counter. The process takes 
0((y — x + 1)(<7 — a + l)a) time. 

□ 

From this operation we can obtain complexities for rel_min_lab jmaj , 
lab_min, lab_minl, lab_acc, and lab_accl. Some of the results we give are 
better than those obtained by a blind reduction; we leave to the reader to check 
these improvements. We also note that an obvious variant of this algorithm is 
our best solution to compute lab_sel, within the same time. 

Lemma 23. BinRel-Str supports rel_sel_obj_maj(a, f3, x, j) in 0(min((n — 
x + l)alg/3, (/? - a+ l)((n -x+ l)s + r))) time. 

Proof. Once again, we have two possible solutions: 
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• Set c <— 0. For each object z in [x, n], use exponential search on z's area of 
5 to find the range S[a, b] corresponding to [a, (3], and set c' c+6— o+l. 
If at some step it holds c' > j, the answer is (S{j — c+a— 1], z). Otherwise, 
update c <— c'. The overall process takes 0((n — x + l)a\gf3) time. 

• Similarly, but first accumulating all the occurrences of all the objects z 
and then finding j using the accumulators. We traverse the area 5'[map(a; — 
1) + l,map(y)] for each label 7 in [a, (3], using successive select 7 (S', f) 
queries, starting at j' = rank 7 (S', map(a; — 1)) + 1. The process takes 
0(((3 -a + l)((n - x + l)s + r)) time. 

□ 

From this operation we can obtain complexities for rel_min_obj_maj, 
rel_acc, objjnin, objjninl, obj_acc, and obj_accl. Once again, some of 
the results we give are better than a blind reduction and we leave the reader to 
verify those. Finally, an obvious variant of this algorithm is our best solution 
to compute obj_sel. 

A final easy exercise for the reader is to show that lab_sell(o;, j, x) can be 
solved in time O(alga), and that obj_sell(a, x, j) is solved in time 0(r + s). 
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